618 results found.
Written
Lexicon,
Language Type:
Bilingual
Languages:
German Swiss German
Availability:
Freely Available
License:
Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Size:
11248 wordsProduction Status:
Newly created-finished
Use:
Speech Recognition/Understanding
-
Paper title:Modeling Dialectal Variation for Swiss German Automatic Speech Recognition
-
Paper track:9.1 Lexical modeling (lexicon learning, units, mor/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Abbas Khosravani | Swisscom Dictionary of spoken and written Swiss German | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
German
Availability:
Not Available
License:
Size:
53.8 MByteProduction Status:
Newly created-finished
Use:
Machine Learning
-
Paper title:Speaking Corona? Human and Machine Recognition of COVID-19 from Voice
-
Paper track:3.7 Perception of paralinguistic phenomena/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Pascal Hecker | COVID-19 listening perception dataset | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
German
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-used
Use:
Machine Learning
-
Paper title:Neural Text Denormalization for Speech Transcripts
-
Paper track:10.4 Rich transcription/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Benjamin Suter | ParaCrawl | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Multilingual
Languages:
Bulgarian Croatian Czech French German Mandarin Polish Portuguese Spanish Thai Turkish
Availability:
From Data Center(s)
License:
ELRA
Size:
18.7 GByteProduction Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Zero-shot Cross-Lingual Phonetic Recognition with External Language Embedding
-
Paper track:8.11 Cross-lingual and multilingual/accent aspects/Poster Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Heting Gao | GlobalPhone | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
Arabic Catalan Chinese Dutch Estonian French German Indonesian Italian Japanese Latvian Mongolian Persian Portuguese Russian Slovenian Spanish Swedish Tamil Turkish Welsh
Availability:
Freely Available
License:
CC0
Size:
2880 hoursProduction Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:CoVoST 2 and Massively Multilingual Speech Translation
-
Paper track:12.1 Spoken machine translation/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Juan Pino | CoVoST 2 | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Multilingual
Languages:
Czech English German
Availability:
Freely Available
License:
CreativeCommons
Size:
10 hoursProduction Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Lost in Interpreting: Speech Translation from Source or Interpreter?
-
Paper track:12.1 Spoken machine translation/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Matúš Žilinec | ESIC | /N |
Documentation:
documentation in English
Speech
Corpus,
Language Type:
Multilingual
Languages:
Arabic English Farsi French German Hindi Japanese Korean Mandarin Russian Spanish Tamil Vietnamese
Availability:
From Owner
License:
LDC
Size:
46 hoursProduction Status:
Existing-used
Use:
Language Identification
-
Paper title:Modeling and training strategies for language recognition systems
-
Paper track:4.1 Language identification and verification, lang/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Raphaël Duroselle | 2003 NIST Language Recognition Evaluation | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Multilingual
Languages:
Arabic Bengali Dari English German Hindi Iranian Persian Japanese Korean Mandarin Chinese Persian Russian Spansih Standard Arabic Tamil Thai Vietnamese Yue Chinese
Availability:
From Owner
License:
LDC
Size:
66 hoursProduction Status:
Existing-used
Use:
Language Identification
-
Paper title:Modeling and training strategies for language recognition systems
-
Paper track:4.1 Language identification and verification, lang/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Raphaël Duroselle | 2007 NIST Language Recognition Evaluation Test Set | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Bilingual
Languages:
English German
Availability:
Freely Available
License:
see readme
Size:
15 GByteProduction Status:
Newly created-finished
Use:
Machine Learning
-
Paper title:NISQA - A Deep CNN-Self-Attention Model for Multidimensional Speech Quality Prediction with Crowdsourced Datasets
-
Paper track:5.11 Speech and audio quality assessment/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Gabriel Mittag | NISQA Speech Quality Corpus | /N |
Documentation:
https://github.com/gabrielmittag/NISQA/wiki/NISQA-Corpus
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
German
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-used
Use:
State of mind prediction
-
Paper title:Modeling user context for valence prediction from narratives
-
Paper track:3.3 Automatic analysis of speaker states/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Aniruddha Tammewar | ulm-state of mind corpus | /N |
Documentation:
None




